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ABSTRACT 

Transposable elements possess specific patterns 
of integration. The biological impact of these 
integration profiles is not well understood. Tf1, 
a long-terminal repeat retrotransposon in 
Schizosaccharomyces pombe, integrates into pro- 
moters with a preference for the promoters of 
stress response genes. To determine the biolo- 
gical significance of Tf1 integration, we took ad- 
vantage of saturated maps of insertion activity and 
studied how integration at hot spots affected the 
expression of the adjacent genes. Our study 
revealed that Tf1 integration did not reduce gene 
expression. Importantly, the insertions activated 
the expression of 6 of 32 genes tested. We 
found that Tf1 increased gene expression by in- 
serting enhancer activity. Interestingly, the 
enhancer activity of Tf1 could be limited by 
Abp1, a host surveillance factor that sequesters 
transposon sequences into structures containing 
histone deacetylases. We found the Tf1 promoter 
was activated by heat treatment and, remarkably, 
only genes that themselves were induced by heat 
could be activated by Tf1 integration, suggesting a 
synergy of Tf1 enhancer sequence with the stress 
response elements of target promoters. We 
propose that the integration preference of Tf1 for 
the promoters of stress response genes and the 
ability of Tf1 to enhance the expression of these 
genes co-evolved to promote the survival of cells 
under stress. 



INTRODUCTION 

Transposons are mobile genetic elements that are wide- 
spread among eukaryotes and comprise a substantial 
portion of eukaryotic genomes. Yet transposon integra- 
tion has the potential to interrupt coding sequences 
throughout the genome of the host cell. They are often 
said to be 'junk' DNA or parasites of the host genome, 
because they make no obvious contribution to the 
cell (1-3). To defend against these deleterious effects, 
the host cells have evolved multiple mechanisms to 
silence transposons including DNA methylation and 
small RNA inhibition mechanisms (4-7). However, 
accumulating evidence demonstrates that the transcription 
of transposons can be activated when cells are under stress 
(8-10). Environmental stress has been shown to induce 
integration events (8). The possibility that transposon in- 
sertions triggered by conditions of stress may benefit the 
host by improving survival is an intriguing hypothesis put 
forward by McClintock (11). However, this model 
remains unsubstantiated. 

In Saccharomyces cerevisiae, the long-terminal repeat 
(LTR) retrotransposons integrate into gene-free positions 
of the chromosomes. Tyl has a strong preference for 
integrating into regions spanning 700 bp upstream of pol 
III transcribed genes, whereas Ty3 integrates one or two 
nucleotides upstream of the transcription start sites of 
tRNA genes (12-15). Although the impact of Tyl and 
Ty3 integration on the transcription of pol III genes has 
not been explored genome wide, these elements do 
increase expression of the limited number of pol III 
transcribed genes tested (16,17). 

In contrast to the integration of Tyl and Ty3 upstream 
of pol III transcribed genes, the LTR retrotransposon Tf 1 
of Schizosaccharomyces pombe integrates into the 
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promoters of pol II transcribed genes (18-21). Saturated 
profiles of insertion sites revealed that Tf 1 integrates with 
a preference for pol II promoters that are induced by en- 
vironmental stresses (21). To determine the biological 
impact of integration, we examined the effect of Tf 1 inte- 
gration on the expression of adjacent genes. We studied 32 
genes often targeted by Tf 1 and found that integration did 
not reduce their expression. In six cases, Tfl insertion 
actually increased the expression of adjacent genes by 
enhancing the levels of the native transcripts. In other 
cases, host factors that participate in genome surveillance 
such as Upfl and Abpl were found to restrict the expres- 
sion of genes that would otherwise have been enhanced by 
Tfl insertion. We found that Tfl transcription was 
induced by heat treatment, and interestingly, only genes 
that themselves were induced by heat could be activated 
by Tfl integration. We propose that it is the synergy of 
Tfl enhancer sequence with the stress response elements of 
target promoters that results in gene activation. We specu- 
late that Tfl integration has the potential to improve the 
survival of individual cells exposed to environmental 
stress. 

MATERIALS AND METHODS 

Media and growth of S. pombe 

Selective media consisted of Edinburgh minimal 2 media 
(EMM) (22) with 2g of a dropout mix, which contained 
all amino acids, 250 ug/ml adenine and uracil but lacked 
nutrients that were absent for selection. A liter of rich 
media, Yeast Extract Plus Supplements (YES), contained 
2g of dropout powder, 30 g glucose and 5g Difco yeast 
extract. Vitamin Bl (10 uM) was added to repress the nmtl 
promoter when indicated. EMM-5-fluoroorotic acid 
(5-FOA) plates contained 1 mg/ml 5-FOA (#F5050, 
United States Biologicals, Swampscott, MA). YES-5- 
FOA-G418 plates contained 500 ug/ml (corrected for 
purity) of Geneticin (#11811-031, Life Technologies, 
Rockville, MD) and 1 mg/ml 5-FOA. All 5. pombe 
strains were grown at 32°C. Cells were grown under 
stress conditions similar to those described previously 
(9). For heat stress, cells were transferred from 32° C to 
a pre-warmed flask in a 39°C water bath, and the cultures 
were incubated for 10 min to allow the media to reach 
39°C. The cultures were then incubated for the designated 
time intervals. To generate oxidative stress, hydrogen 
peroxide (H 2 0 2 , Sigma) was added to a final concentra- 
tion of 0.5 mM. To create osmotic stress, a final concen- 
tration of 1 M sorbitol (Sigma) was added to the cultures. 
For heavy metal stress, cadmium sulfate (CdS0 4 , Sigma) 
was added to a final concentration of 0.5 mM. 

Yeast strains and plasmid construction 

The yeast strains and plasmids used in this study are listed 
in Supplementary Tables SI and S2, respectively. The 
oligonucleotide sequences are listed in Supplementary 
Table S3. The upfl gene was deleted from YHL1101 
using the drug-resistance marker, natl, encoding 
nourseothricin (NAT) acetyltransferase (23). A DNA 
fragment containing natl flanked by 80 bp of upfl 



homologous sequences was created by polymerase chain 
reaction (PCR) using pCR2.\-nat (pHL2621) as the 
template and HL2252 and HL2253 as the primers. To 
induce homologous recombination between upfl- and 
the «a?/-containing fragment, 5 ug of this natl fragment 
was introduced to YHL1101 with the lithium acetate 
transformation method (22). NAT-resistant colonies 
were isolated on YES plates containing 100 ug/ml NAT 
(Werner BioAgents, Germany). DNA blots and PCR were 
performed to confirm the single gene deletion. The abpl 
deletion strain was a gift from Shiv Grewal. Strains con- 
taining Tfl insertions and lacking abpl were obtained by 
standard genetic crosses. 

To create a plasmid with wild-type upfl + , a 3.8 kb 
fragment was amplified by high-fidelity DNA polymerase 
pfx (Invitrogen) with HL2417 and HL2418 and ligated 
into pHL1288 digested with Spel-Sall. The open reading 
frame (ORF) of upfl in the constructed plasmid was con- 
firmed by DNA sequencing. 

Transposition assay 

The transposition assay of Tfl was described previously 
(24). The strains were transformed with the Tfl plasmid 
pHL2541 by selection on EMM lacking uracil. Strains 
were arranged on EMM-Ura+Bl plates in 2 cm 2 
patches. After 2 days of incubation, these plates were 
replica printed to EMM-Ura-Bl to induce transcription 
of Tfl . After 4 days of incubation, the strains were replica 
printed to EMM+Ura+Bl+FOA to select against the Tfl 
donor plasmid. Following 2 days of incubation, transpos- 
ition events were selected by replica printed to 
YES+FOA+G418. 

DNA preparations and blots 

Genomic DNA was extracted from cells using glass beads 
and phenol (25). Ten micrograms of genomic DNA 
isolated from cells (OD 600 = 1.0) were digested by 
Hindlll, and the products were separated on a 1.0% 
agarose gel. The gel was transferred to Genescreen Plus 
(Perkin Elmer), and the filters were hybridized with a 
1.0-kb neo probe derived from a BamHI digest of 
pHL765. 

Determination of Tfl insertion sites by inverse PCR 

Five micrograms of genomic DNA were digested by 
Hindlll. To circularize the fragments, the DNA was 
diluted to 1 ng/ul and 17 ul was ligated with 1 ul contain- 
ing 20 U of T4 DNA ligase (Invitrogen) and 2 ul buffer at 
18°C overnight. Several parallel ligations were performed 
and extracted with phenol-chloroform. One hundred 
nanograms of ligated DNA was amplified by Titanium 
Taq (Clontech) with primers HL2347 and HL1104. 
After gel electrophoresis, the desired products, whose 
sizes were predicted by DNA blots, were gel purified 
(Qiagen), ligated into pCR2.1-TOPO (Invitrogen) and 
sequenced. After determining the sequence of the 
genomic DNA downstream of the 3' LTR, primers were 
designed to PCR amplify the junction between the 
genomic DNA and the 5' LTR. In all cases, 5 bp target 
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site duplications were found, indicating that the Tfl trans- 
posons were introduced by bona fide integration. 

RNA preparations 

RNA was extracted from 40 ml of cells that were har- 
vested at OD 60 o = 1 .0 and resuspended in 400 ul of 
buffer AE (50 mM NaOAc, pH 5.3 and 10 mM EDTA) 
and 40 ul of 10% sodium dodecyl sulfate. The resuspended 
cells were combined with 400 ul of acid phenol (Ambion) 
and 400 ul of acid-washed glass beads (Sigma, G-8772). 
The cells were lysed by lOmin of bead beating (Mini 
Beadbeater, Biospec Products) in cycles of 30 sec on and 
30 sec off. The cell ly sates were cooled at — 80° C for lOmin 
and then centrifuged for 5min at 14000rpm to separate 
the aqueous phase from the beads, phenol and cell debris. 
Phenol-chloroform extractions were repeated until the 
interfaces were no longer white. After adding 40 ul 3 M 
NaOAc and 1ml 100% ethanol, RNA was precipitated 
and stored at — 80°C until use. For cells exposed to 
stress, cultures with OD 600 = 1.0 were harvested quickly 
before and at 15 and 60min after exposed to stress. 

Quantitative reverse transcription PCR 

Quantitative reverse transcription PCR (qRT-PCR) was 
conducted and analyzed as described previously (26). 
Briefly, 44 ul of total RNA (lower than 200ng/ul) was 
treated with 1 ul of TURBO DNase (Ambion) in 5 ul 
buffer to remove any residual genomic DNA. Following 
incubation for 30min at 37°C, the reaction was mixed 
with DNase inactivation reagent (Ambion). Then, 500 ng 
of total RNA was reverse transcribed into cDNA using 
random primers with the high-capacity cDNA reverse 
transcription kit (Applied Biosystems). As a control for 
contamination by genomic DNA, the reaction lacking 
reverse transcriptase was also subjected to qRT-PCR. 
After cDNA was synthesized and diluted to about 
lOng/ul, 6ul of this cDNA was added into 1 well of a 
96-well plate and mixed with lOul of 2X Power SYBR 
green PCR master mix (Applied Biosystems) and 4ul of 
gene specific primer sets (each primer is 500 nM). The 
sample was run in the Applied Biosystems StepOnePlus 
Real-Time PCR System according to their instructions. 
All primer sets were validated for their PCR efficiency 
by serial dilution of standard cDNA. Efficiency values 
of 1.8-2.2 were obtained. Each sample from three inde- 
pendent yeast colonies was quantified and averaged as 
triplicates. Expression of genes in all conditions was 
normalized to actl. Relative changes in expression of the 
genes were determined by the comparative Ct (2~ ct ) 
method (26). For the strand-specific qRT-PCR, total 
RNA was reverse transcribed into cDNA using strand- 
specific primer with MultiScribe reverse transcriptase 
(Applied Biosystems). The conditions of other reactions 
were the same as the normal qRT-PCR. The sense or anti- 
sense RNA of tested genes was normalized to sense RNA 
of actl to determine the presence and relative levels. The 
reaction without reverse transcriptase was conducted for 
the control of genomic DNA, and the reaction without any 
primer was a control of self-priming of RNA during RT. 



A strand-specific control was also performed using the 
specific primer for sense or antisense RNA of actl. 

RNA blots 

RNA blots were performed as described previously (20). 
Ten micrograms of total RNA was separated on a 1.0% 
agarose-formaldehyde denaturing gel. After the RNA was 
transferred to a nylon membrane, it was hybridized with 
random primed DNA probes (Roche), oligonucelotide 
probes or RNA probes synthesized by in vitro transcrip- 
tion (Ambion). As a loading control, the blots were either 
stained with ethidium bromide for the ribosomal RNA 
(rRNA) or hybridized to a probe of actl. The actl 
probe was produced from a 1 kb Eco RI-Bam HI 
fragment of pHL859-l. The levels of RNA from three 
independent yeast colonies were quantified with 
phosphoimaging and averaged. Expression of genes was 
normalized to rRNA in the stress conditions and 
normalized to actl in other conditions. 

Rapid amplification of cDNA ends 

The 5' and 3' rapid amplification of cDNA ends were 
assayed by the RNA ligase-mediated RACE kit 
(Ambion) according to the manufacturer's instruction. 
After nested PCR and separation in a 2% agarose gel, 
the desired and specific DNA products were gel purified 
and ligated with pCR2.1-TOPO vectors (Invitrogen) and 
sequenced from both ends. Only high-quality and specific 
sequences were analyzed. 

RESULTS 

Integration of Tfl into Pol II transcribed promoters did 
not reduce promoter function 

The integration of Tfl into pol II transcribed promoters 
has the potential to disrupt the expression of the genes on 
either side of the insertions (20). To investigate the effect 
of Tfl integration on the expression of flanking genes, we 
generated a collection of strains that had single copy in- 
sertions of Tfl . To create these strains, Tfl containing a 
neo gene (Tfl-«eo) was expressed from a plasmid, and cells 
that obtained inserts were isolated on a medium contain- 
ing G418 (24,27). The insertions were generated in a strain 
that had no pre-existing copies of Tfl. For reasons ex- 
plained in a later section, this strain contained a deletion 
of upfl (upflA) and a plasmid copy of upfl + to comple- 
ment the deletion. 

From the strains that became resistant to G418, 14 were 
selected at random and analyzed for the presence of Tfl - 
neo insertions. DNA blots showed that each strain con- 
tained a single copy of Tfl-neo (Supplementary Figure 
SI). Inverse PCR was used to sequence the position of 
the insertions, and these data revealed that each integra- 
tion mapped to a unique site in the genome. All inserts of 
Tfl were flanked by 5 bp duplications, a signature struc- 
ture demonstrating that each insertion was catalyzed by 
the Tfl integrase (28). Table 1 lists each insertion, its 
orientation and the genes on either side of the integration. 



778 Nucleic Acids Research, 2013, Vol. 41, No. 2 



Table 1. Effect of Tfl integration on expression of adjacent genes in the strains with and without apfl 
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0.96+0.38 


0.7010.15 




SPNCRNA.506 








24 


2.61+0.71 


2.83+0.67 




SPCC74.04 








4 675 


1.13+0.19 


0.94+0.09 


5 


SSCll 


68 


1 




961 


2.00±0.29 


1.80+0.48 




SPNCRNA.81 1 








-225 


2.15+0.36 


2.39+0.62 










^ y 


264 


1 .24±0.22 


1.35+0.14 


6 


SPCC1020.05 


157 


3 




346 


1.12±0.21 


1.2910.31 




Tf2-12 








1 215 


n.d. 


n.d. 


7 


SPAC24B 11.08c 


247 


1 


{ ^ ^ 


2 623 


1.05±0.12 


1.0210.14 




SPNCRNA.623 








44 


1.31+0.39 


2.2410.62 




SPAC24B1 1.09 






^ ^ 


655 


1.30±0.14 


1.36+0.09 


8 


ish 1 


434 


2 




553 


0.99±0.22 


0.6010.03 




hbal 








714 


0.97+0.20 


0.9810.37 


9 


spn 7 


439 


2 




862 


1.03+0.21 


1.31+0.57 




SPBC19F8.02 








166 


1.26±0.22 


1.3910.34 


10 


si Hi 4 


533 


2 




939 


1.14+0.39 


0.8410.33 




mil 001 






^ — y 


176 


1.27±0.24 


1.2410.16 


11 


SPBC4F6.05c 


537 


2 


<- <- 


342 


1.15+0.19 


1.2910.28 




kinl 






< — > 


1654 


0.8310.16 


0.7810.30 


12 


tif512 


785 


2 


< — > 


184 


1.0510.12 


1.1110.25 




vps52 






— > — » 


220 


0.8410.29 


0.6010.20 


13 


cssl 


1162 


2 




565 


1.1610.32 


1.13+0.35 




reel 4 






< — > 


861 


0.97+0.24 


1.2910.54 


14 


SPAC19A8.06 


2114 


1 




599 


0.8810.28 


0.6910.17 




sst4 






< — > 


360 


0.89+0.21 


1.33+0.27 



"Red arrow represents Tfl; black arrow represents protein-coding gene; green arrow represents non-coding RNA gene. The strand of 
the non-coding RNA genes depicted (SPNCRNA.628, SPNCRNA.506 and SPNCRNA.811) was based on strand- specific qRT-PCR. 
For SPNCRNA.623, the transcribed strand was determined with a strand-specific RNA blot. 

b Distances between Tfl inserts and coding genes were measured from the end of Tfl to the ORF. The distances to non-coding RNA 
genes were measured to the ends of the non-coding transcripts. The locations of the ORFs. SPNCRNA.l 164 and SPNCRNA.811 were 
based on the PomBase annotations (31). The distances between the non-coding RNA genes SPNCRNA.628, SPNCRNA.506, 
SPNCRNA.623 and Tfl were based on the results of 5' and 3' RACE (Supplementary Figure S3B, S3C and S3H). The negative 
numbers of distances indicates that the Tfl insertion was located inside of the gene. 
c Values are results of qRT-PCR and are the average of three independent experiments (mean ± SD). 



Of 73 125 insertions of Tfl that were sequenced previ- 
ously, 93% occurred in intergenic regions containing pol 
II transcribed promoters (21). Approximately 1000 
promoter regions are preferred for integration, and 
based on their specific level of integration, each intergenic 
region was assigned a ranking. All 14 insertions in this 
study occurred in intergenic sequences containing pol II 
promoters, and 12 of the 14 were positioned in intergenic 
regions that were ranked among the top 1000 targets of 
integration in the S. pombe genome (Table 1). As a result, 
the impact of these insertions on the expression of the 
adjacent genes has particular relevance. 

To test systematically the effect of Tfl insertion on the 
expression of adjacent genes, we used qRT-PCR and 



measured the relative expression of genes adjacent to 
Tfl-weo and compared this with the expression of the 
genes in the parent strain that lacked the insertions. 
In this study, we measured the mRNA from the coding 
and non-coding genes on either side of the insertions 
except for that of Tf2-12 (insertion #6), which as a repeat 
element could not be independently analyzed. Five of the 
intergenic sequences with insertions are reported to produce 
non-coding RNAs (ncRNAs) (29,30). Although it is not 
known whether these ncRNAs possess function, we 
measured these transcripts as well. As a result, we moni- 
tored a total of 32 genes (Table 1). Although the insertion 
of 6 kb of transposon DNA into promoter regions might be 
expected to damage promoter function, none of the 32 
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genes exhibited reduced expression resulting from the inser- 
tion of Tfl (Table 1, relative fold expression, upfl + ). The 
lack of genes with reduced expression is unlikely due to 
negative selection during isolation of the insertion strains 
because these intergenic sequences were also common in- 
sertion sites in diploid strains (21). 

Integration of Tfl can enhance the expression of 
adjacent genes 

Insertion of Tfl-neo did increase the expression of four 
genes adjacent to integration sites. In the case of ssal, a 
gene encoding heat-shock protein 70 (Hsp70), an insertion 
961 bp upstream of the ORF enhanced its expression by 
2.0-fold. In addition, the expression of three ncRNA genes 
SPNCRNA.628, SPNCRNA.506, and SPNCRNA.811 
were enhanced 2.6, 2.6 and 2.1-fold, respectively 
(Table 1). The results of strand-specific qRT-PCR 
revealed that SPNCRNA.628 and SPNCRNA.506 are 
transcribed on the top strand, and contrary to annotations 
of the reference genome, SPNCRNA.811 was transcribed 
on the bottom strand of the reference genome 
(Supplementary Table S4) (31). Together, these results 
demonstrate that Tfl insertion had the capacity to stimu- 
late the expression of adjacent genes. 

Host surveillance mechanisms can restrict the impact of 
Tfl integration on the expression of adjacent genes 

The finding that Tfl insertion enhanced the expression of 
4 adjacent genes led us to question why the other 28 were 
not activated. One possibility we considered was that the 
expression of ORFs adjacent to Tfl could potentially be 
enhanced by mRNA originating from Tfl but that much 
of the mRNA was degraded by the nonsense-mediated 
decay (NMD) pathway, the system that degrades 
aberrant mRNA (32,33). Indeed, mRNA originating 
from a closely related transposon Tf2 does read into 
adjacent sequence and in some cases activates the expres- 
sion of neighboring genes (10). This type of read-through 
transcript could have premature stop codons that would 
trigger NMD. To make it efficient to test whether NMD 
might restrict the expression of ORFs adjacent to Tfl , the 
original insertions were created in a strain that had a 
deletion of upfl, the gene encoding a key factor required 
for NMD. To complement the deletion, the strain also 
contained a plasmid-encoded copy of upfl. qRT-PCR 
revealed that the plasmid copy of upfl produced levels 
of mRNA similar to that produced by a single genomic 
copy of upfl (data not shown). The expression studies 
described above were all done with strains containing 
the deletion of upfl and the plasmid encoded upfl. 

We tested NMD for a potential role in restricting the 
expression of the genes adjacent to Tfl by measuring the 
impact of Tfl insertion on mRNA levels in strains that 
lacked both chromosomal and plasmid copies of upfl. 
Surprisingly, qRT-PCR revealed that the expression of 
31 of the adjacent genes did not change when upfl was 
absent (Table 1). Although NMD may not play a frequent 
role in reducing the RNA of genes next to Tfl, we found 
that one Tfl insertion did enhance RNA levels of 



Table 2. The effect of Tfl integration on expression of adjacent 
genes in strains with and without abpl 
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The RNA levels were measured by qRT-PCR from three independent 
strains and averaged. 



SPNCRNA.623 2.2-fold with an increase that was 
specific for cells lacking upfl (Table 1, insertion #7). 

Another host factor that may limit the ability of Tfl to 
enhance expression of adjacent genes is the CENP-B 
homolog Abpl. The previous work showed that Abpl 
plays a role in clustering Tfl and Tf2 elements into 
nuclear bodies and inhibiting the expression of the trans- 
posons by recruiting histone deacetylases (HDACs) (34). 
In at least one case, Abpl inhibits the expression of a gene 
next to an existing LTR (34). To investigate directly 
whether Abpl limits the ability of de novo insertions to 
increase the expression of adjacent genes, we crossed six 
strains containing Tfl insertions with a strain lacking 
abpl. We then used qRT-PCR to measure whether Tfl 
insertion in strains lacking abpl enhanced the expression 
of the adjacent genes. These strains all contained the single 
chromosomal allele of upfl. Interestingly, we found that in 
cells lacking abpl, Tfl insertion did increase the expres- 
sion of spn7, cssl and SPBC4F6.05c, whereas Tfl integra- 
tion had no effect on expression of these genes in cells 
containing abpl (Table 2). The activation of ssal and 
SPNCRNA.811 observed in cells with abpl was not 
significantly changed in the cells that lacked abpl. 
These results indicated that Abpl did play a role in in- 
hibiting the ability of Tfl to enhance the expression of 
some adjacent genes. 

Conditions of heat and oxidative stress were required for 
Tfl to increase the expression of some adjacent genes 

Tfl integration favors promoters of stress response genes, 
and several of the genes targeted by Tfl in this study are 
reported to be induced by heat or oxidative stress. As a 
result, we tested whether conditions of stress would reveal 
or enhance the ability of Tfl sequences to increase the 
expression of adjacent genes. All 14 strains with a Tfl 
insertion were incubated at 39°C for 15 and 60min, and 
qRT-PCR of their RNA revealed the expression of four 
genes, SPNCRNA.506, SPNCRNA.811, spn7 and cssl, 
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was enhanced by adjacent Tfl inserts (Table 3). Although 
expression of SPNCRNA.506 and SPNCRNA.811 was 
enhanced by Tfl in cells grown in regular conditions 
(Tables 1 and 3), their enhancements were increased by 
the heat treatments, 2.2- and 2.7-fold, respectively. In 
contrast, the increase of spn7 and cssl expression by Tfl 
sequence required heat stress (Tables 1 and 3). An anti- 
sense ncRNA gene (SPNCRNA.1588) has been predicted 
to overlap spn7 (29). However, we used strand-specific 
qRT-PCR and found that it was the sense RNA of spn7 
that was enhanced by adjacent Tfl sequence after heat 
treatment (Table 3). Despite being enhanced by Tfl in 
cells grown at regular temperature, the expression of 
genes ssal (Supplementary Figure S2) and 
SPNCRNA.628 was not increased by Tfl in cells 
exposed to heat treatment (Table 3). 

Many genes of S. pombe induced by heat treatment are 
also induced by oxidative stress (9). Therefore, we tested 
whether the genes activated by adjacent Tfl sequence by 
heat treatment were also induced by oxidative stress. The 
expression of genes enhanced by Tfl insertion when 
exposed to heat was also enhanced when cells were 
treated with H 2 0 2 (Table 4). For SPNCRNA.506 and 
SPNCRNA.811, the levels were significantly greater 



when cells were treated with H 2 0 2 compared with heat 
treatment. 

Although environmental stress allowed Tfl sequences 
to increase the expression of 4 of the genes studied, the 
expression of the other 28 genes adjacent to Tfl insertions 
was unchanged with these conditions (Tables 3 and 4). We 
tested whether the expression of genes unaffected by Tfl 
tested during stress might be limited by NMD. 
Nevertheless, the expression of SPNCRNA.623, vps52 
and recl4 in upflA cells remained unchanged by Tfl in- 
sertion in cells treated with heat (data not shown). 
Collectively, these data indicate that the ability of Tfl to 
increase the expression of adjacent genes can be mediated 
by environmental stress. 

The transcription of Tfl is induced by environmental 
stress 

To understand how Tfl sequences might enhance the ex- 
pression of adjacent genes, we tested whether the 
promoter of Tfl itself responded to environmental 
stress. Using qRT-PCR, we measured the expression of 
Tfl in the strain that contained a single-copy Tfl 
adjacent to ssal (insertion #5). This copy of Tfl was 
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Table 4. Effect of Tfl integration on expression of adjacent genes under oxidative stress 
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induced substantially when cells were heat treated and to a 
lesser extent when subjected to oxidative stress (Figure 1A, 
heat vs. H 2 0 2 ). In comparison, osmotic stress and 
exposure to a heavy metal did not induce this Tfl 
element. To determine whether the stress-mediated induc- 
tion of Tfl mRNA was dependent on the location of the 
transposon, we surveyed Tfl mRNA levels produced from 
all 14 insertion positions. Although the expression of Tfl 
at all 14 positions was induced by heat treatment, Tfl 
mRNA from insertions #5 and #10 increased more than 
10-fold (Figure IB). In the five different insertion pos- 
itions tested, Tfl mRNA was also induced by oxidative 
stress, indicating that both heat and oxidative stress trig- 
gered increased transcription of Tfl regardless of the 
position of the transposon (Figure 1C). 

Tfl increases the expression of adjacent genes by 
providing enhancer activity 

The enhanced expression of genes by adjacent copies of 
Tfl may have resulted from transcription that initiated 
from within the transposon. Alternatively, Tfl may 
contain enhancer elements that could have increased the 
activity of promoters that pre-existed the insertion event. 
To distinguish between these possibilities, we ran RNA 
blots and tested whether mRNAs with levels enhanced 
by Tfl insertions were the same size as those produced 
in the absence of Tfl . An RNA blot of insertion strain 
#5 and the parent lacking the insertion confirmed that the 
insertion of Tfl increased the expression of ssal by 
2.0-fold (Figure 2A). More importantly, the ssal mRNA 
produced by insertion strain #5 co-migrated with the ssal 
mRNA from the strain that lacked the Tfl insertion. The 
results of 5' RACE experiments and the sequencing of the 
RACE products revealed that the ssal mRNA increased 
by the Tfl insertion initiated from the same nucleotide as 
the mRNA produced by the strain lacking the insertion, 
211 bp upstream of the ORF (Figure 2B and 
Supplementary Figure S3 A). This result indicated that 
the insertion of Tfl increased the transcription of ssal 
from its natural start site. 

Ssal is a member of the Hsp70 superfamily of stress- 
induced protein chaperones. This raised the possibility 



that ssal expression was increased not because it was 
adjacent to Tfl sequence but because insertion of the 
transposon resulted in expression of Tfl protein that in 
turn triggered induction of stress response proteins such as 
Ssal. If true, increases of ssal expression would occur in- 
dependent of the position of Tfl integration. To test this 
possibility, we measured the ssal mRNA produced by all 
14 of the insertion strains. RNA blots revealed that inte- 
gration adjacent to ssal (insertion #5) was the only 
position of Tfl that increased ssal expression 
(Figure 2C). This result demonstrated that the enhanced 
expression of ssal was the direct result of the proximity of 
Tfl sequence to ssal. 

To identify the mechanism that enhanced the expression 
of ncRNA, we analyzed RNA from the insertion strain #3 
on blots. This RNA blot confirmed the qRT-PCR result 
that insertion of Tfl did increase the expression of 
SPNCRNA.628 (Supplementary Figure S4A and S4B). 
The blot showed that the increase was 3.7-fold. The 
enhanced SPNCRNA.628 RNA co-migrated with the 
SPNCRNA.628 RNA expressed by the parent strain 
that lacked the insertion. This indicates that Tfl 
sequence can also increase the expression of ncRNA 
genes by providing enhancer activity. Our RACE 
analysis of the 5' start of transcription revealed that the 
SPNCRNA.628 RNA enhanced by Tfl initiated from its 
natural start site (Supplementary Figures S3B and S4C). 
Interestingly, 3' RACE assays revealed that Tfl inserted 
within SPNCRNA.628 28 bp from the end of the gene 
(Supplementary Figures S3B and S4C). The result of this 
insertion was the SPNCRNA.628 RNA terminated at 
base 262 of the Tfl LTR. 

To determine how Tfl enhances the expression of 
adjacent genes in response to environmental stress, we 
again characterized mRNAs. RNA blots of cells heat 
treated for 60 min showed that the spn7 mRNA increased 
by Tfl co-migrated with spn7 mRNA produced by the 
strain that lacked the insertion (Figure 3A). The same 
cells exposed to oxidative stress generated increased 
levels of spnl mRNA that co-migrated with the spnl 
mRNA from cells lacking the insertion (Figure 3B). 
In cells exposed to heat or oxidative stress, the cssl 
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mRNA enhanced by insertion of Tfl also co-migrated 
with cssl from cells lacking the insertion (Figure 3C and 
D). In each case, regardless whether cells were treated with 
heat or oxidative stress, 5' RACE followed by sequencing 
showed that the spn7 and cssl mRNAs with levels 
increased by Tfl started from their natural initiation 
sites (Figure 3E and F and Supplementary Figures S3D 
and S3E). 

In experiments described earlier, Abpl was found to 
restrict the Tfl -mediated increase in the expression of 
spn7, SPBC4F6.05c and cssl (Table 2). To determine 
what type of enhancement mechanisms were restricted 
by Abpl, we again characterized the mRNAs. In the 
absence of abpl, the cssl and SPBC4F6.05c mRNAs 
increased by adjacent copies of Tfl co-migrated with the 



RNAs from the strain lacking Tfl (Figure 4A and B). 
Again, the results of 5' RACE demonstrated that the 
enhanced mRNAs start from the same nucleotides as the 
mRNAs from the strain that lacked Tfl (Figure 4C and D 
and Supplementary Figure S3F and S3G). These data 
indicate that the enhanced expression of genes by 
adjacent copies of Tfl results from increased activity of 
the genes' promoters. This was true regardless of whether 
the increases were mediated by environmental stress or the 
absence of the genome surveillance protein Abpl. 

The ability of Tfl to increase the activity of adjacent 
promoters suggests that the transposon possesses 
enhancer elements. Previously published data indicated 
that Abpl recruits HDACs to LTRs of Tfl and Tf2, 
and to full-length Tf2s, retrotransposons that are closely 
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related to Tfl and reside in laboratory strains of S. pombe 
(34). The deacetylases, Clr3 and Clr6, inhibit the transcrip- 
tion of the Tf2s in the laboratory strain and a de novo 
inserted Tfl at SPAC7D4.08. If Tfl increases the expres- 
sion of adjacent genes by providing enhancer activity, the 
inhibitory activity of Abpl on spn7, SPBC4F6.05c and 
cssl would likely act on the promoters of the Tfl 
adjacent to these genes. We did find that expression of 
the Tfl copies adjacent to spn7, SPBC4F6.05c and cssl 
was significantly restricted by Abpl (Supplementary 
Figure S5, insertions #9, #11 and #13). Three other pos- 
itions of Tfl insertion were also tested, and in each case, 
Abpl inhibited Tfl expression (Supplementary Figure S5). 

Upfl degrades SPNCRNA.623 RNA enhanced 
by Tfl insertion 

One possible mechanism that could cause Tfl insertions to 
increase the expression of adjacent genes would be the 
transcription of RNA that initiates in Tfl continues into 
the coding sequences of neighboring genes. Although this 
mechanism was not observed, we tested whether these 



read-through RNAs might be generated by Tfl but 
degraded by the NMD pathway. Although NMD did 
not play a general role in restricting the enhancement of 
genes by Tfl, the expression of 1 of the 32 genes studied, 
SPNCRNA.623, was enhanced by an adjacent copy of Tfl 
when upfl was deleted (Table 1, insertion #7). RNA blots 
demonstrated that SPNCRNA.623 was increased 4.3-fold 
by the adjacent copy of Tfl, and this increase was only 
observed in the absence of upfl (Figure 5A and B). 
Although this result suggested that Upfl had degraded 
an RNA initiated in Tfl that continued into 
SPNCRNA.623, strand-specific probes of the RNA blot 
demonstrated that SPNCRNA.623 is a plus strand tran- 
script and therefore could not have been encoded by 
an RNA initiated from the downstream Tfl (Figure 5A 
and B). 

To understand how Tfl increased transcription of 
SPNCRNA.623 and why the NMD pathway degraded 
this RNA, the start and termination sites of this transcript 
were mapped with 5' and 3' RACE. The sequences of the 
5' RACE products revealed that the RNA of 
SPNCRNA.623 started at the same site regardless of 
whether Tfl was present or whether upfl was deleted 
(Figure 5C and Supplementary Figure S3H). However, 
3' RACE assays indicated that the adjacent copy of Tfl 
caused the SPNCRNA.623 RNA to extend past its normal 
termination site and end 262 bp into the 5' LTR 
(Figure 5C and Supplementary Figure S3H). This termin- 
ation site in the LTR corresponded to the same position of 
the 3' end of the SPNCRNA.628 RNA that continued into 
the Tfl sequence (Supplementary Figure S4C), indicating 
that the termination is mediated by a signal in the LTR. 
The extended version of the SPNCRNA.623 RNA had 
premature termination codons that would be recognized 
by the NMD pathway. The termination codons would 
explain why Upfl degraded the extended version of the 
SPNCRNA.623 transcript. Despite the 3' extension of the 
transcript, the RNA with levels enhanced by Tfl initiated 
from its natural start site, indicating that the insertion 
increased the amount of the RNA by providing 
enhancer activity. 

Genes with expression increased by Tfl insertions are 
stress response genes 

Of the 32 genes analyzed in this study, 6 had expression 
enhanced by Tfl sequence, in cells grown at 32° C or in 
response to heat treatment (Tables 1 and 3). Results of 
RNA blots and RACE assays indicated that the transcrip- 
tion of these genes was increased by enhancer activity 
carried in Tfl. One key question was why did the 
enhancer activity in Tfl not increase the transcription of 
the other 26 genes. One possibility was that the chromatin 
context of each insertion imposed varying degrees of 
access to transcription factors. If the enhancer function 
of some Tfl copies were impaired, these insertions 
would not be expected to increase expression of adjacent 
genes. To test this possibility, we asked whether there was 
a correlation between the genes with expression increased 
by Tfl sequences and the transcription levels of the 
adjacent copies of Tfl. qRT-PCR revealed that the 
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Figure 3. Tfl insertion increased the expression of spnl and cssl when cells were exposed to stress. (A and B) The increase in spnl mRNA caused by 
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copies of Tfl of insertion #7 and #10 were expressed 10 
and 7 times higher than the other inserts, respectively 
(Supplementary Figure S6A). None of the six genes with 
expression enhanced by Tfl were adjacent to inserts #7 or 
#10. This poor correlation suggests that it was not a vari- 
ation in the enhancer activities of the Tfl inserts that 
determined which adjacent genes had increased expres- 
sion. In addition, the transcription levels of the cellular 
genes in the strain that lacked Tfl insertions were no 
higher for the genes that had expression increased by 
Tfl (Supplementary Figure S6B), indicating that it was 
not the chromatin context of the insertion sites that 
determined which genes had expression increased by Tfl 
insertions. We also recognized that it was not solely 
the Tfl elements with the greatest response to stress 



(Figure IB, insertions #5 and #10) that determined 
which genes in stress conditions had increased expression 
when adjacent to Tfl (Table 3). Thus, the Tfl promoters 
were not the only factors responsible for increased expres- 
sion of adjacent genes. 

The increased expression of adjacent genes by Tfl 
enhancer elements may require interactions between the 
transcription factors bound to the Tfl enhancer and 
those bound to the adjacent promoter. This interaction 
would be more likely to occur if the transcription factors 
bound at the two sites were the same protein or closely 
related. The promoters of all 14 Tfl copies were induced 
by heat (Figure IB). To determine which of the 32 genes 
flanking Tfl insertions might be regulated by the same 
stress response factors as Tfl, we asked which genes 
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the products. 



were induced by heat or oxidative stress independently of 
Tfl. In the strain lacking Tfl insertion, 8 of the 32 genes 
were induced 2-fold or more by heat treatment 
(Supplementary Table S5). Most of the genes tested were 
also induced by oxidative stress. Importantly, all six genes 
that had expression increased by adjacent Tfl sequences 
were among the eight genes activated by heat. The correl- 
ation between the genes with expression enhanced by Tfl 
and those that are stress induced is strong as all the genes 
with expression increased by Tfl were also stress induced. 
(The Fisher's exact test two-tailed P-value is less than 
0.0001.) The strength of this correlation argues that 
inserts of Tfl can enhance the transcription of adjacent 
genes only if the genes have stress response promoters. 

To examine the promoters that had expression 
enhanced by Tfl for stress response elements, we 
analyzed their sequences for common motifs. The 
motif-based sequence analysis tool MEME (35) identified 
a motif present in all six promoters that had activity 
increased by Tfl and in the Tfl promoter itself 
(Supplementary Figure S7A-S7C). Importantly, the 
search tool FIMO found that this motif was not present 
in 25 of the 26 genes not affected by Tfl insertion (36). 
Although there is little information about the sequences 
bound by the transcription factors of S. pombe, we used 
the motif comparison tool TOMTOM (37) to compare the 
motif present in Tfl and the promoters activated by Tfl to 
specific sequences bound by proteins in S. cerevisiae. 
We found the motif was significantly similar to the 



sequence bound by Skol of S. cerevisiae, a ATF/CREB 
stress response factor (P = 0.038) (Supplementary Figure 
S7D). What is more, the sequence bound by Skol in S. 
cerevisiae (TGACGT, Supplementary Figure S7D) is very 
similar to the sequence (TGACGT) (38) bound by the 
ATF/CREB stress response factor Atfl of 5*. pombe. 
Together these analyses indicate that the promoters 
enhanced by Tfl insertions and the Tfl promoter itself 
share a common motif that may be recognized by an 
ATF/CREB stress response factor. 



DISCUSSION 

Integration of transposons has the potential to destroy 
coding sequences throughout the genome of the host 
cell. However, because the fate of transposons is inexor- 
ably tied to that of the cell, a variety of mechanisms have 
evolved that direct integration to regions of the genome 
that lack coding sequences. In S. cerevisiae, Ty5 integrates 
into regions of heterochromatin and Tyl and Ty3 insert 
into the gene-free sequences upstream of pol III 
transcribed genes (12,14,39,40). In plants, many trans- 
posons accumulate in heterochromatin, and in the case 
of the chromovirus transposons and Tall, this distribution 
likely resulted from integration mechanisms that recognize 
histone modifications (41,42). In light of the mechanisms 
that direct integration to gene free sequences, it is 
surprising that Tfl integrates in the promoters of pol II 
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Figure 5. Upfl restricted the increased expression of SPNCRNA.623 by the enhancer activity of Tfl . (A) The orientation and position of the Tfl 
insertion between SPNCRNA.623 and SPAC24B11.09 is shown. The sequence of the probe is represented by a line underneath SPNCRNA.623. The 
distances from the Tfl insertion to the adjacent ORFs is shown above the line. (B) The levels of SPNCRNA.623 RNA were measured from strains 
that lacked Tfl or contained Tfl inserted downstream of SPNCRNA.623. The RNA blots were hybridized with strand-specific probes to determine 
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transcribed genes (20,43). Integration in promoters was 
expected to disrupt recognition elements and reduce tran- 
scription. Also, Tfl insertions were expected to reduce 
expression of adjacent genes because the binding of 
Abpl to Tf LTRs recruits HDACs that not only reduces 
transcription of the LTRs but also has been found to 
inhibit expression of an adjacent gene (34). Indeed, we 
confirmed that Abpl inhibited Tfl transcription, and in 
three cases, Abpl reduced the transcription of genes next 
to de novo inserted copies of Tfl. Nevertheless, our study 



revealed that insertion of Tfl next to 32 different genes did 
not reduce the expression of any of those genes. Saturated 
profiles of integration levels at each promoter of S. pombe 
revealed that the 32 promoters studied here are among the 
most common sites of Tfl integration (21). As a result, our 
findings indicate that the bulk of Tfl integration events do 
not reduce the expression of adjacent genes. 

Tfl integration increased the expression of 6 of 32 genes 
studied, and if not, for the restrictive action of the cellular 
genes upfl and abpl, Tfl would have enhanced the 
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expression of 2 additional genes. The LTRs of Tf2 are 
transcribed (30,34,44-46), and in rare cases, the RNAs 
can continue into adjacent sequence and read-through 
neighboring genes (10). However, in the examples 
studied here, the increases in mRNA of genes caused by 
Tfl insertion were not the result of read-through tran- 
scripts. Instead, the results of RNA blots and 5' RACE 
assays revealed that Tfl carried enhancer activity that 
increased the promoter activity of genes adjacent to Tfl. 

One significant question is why did Tfl insertion 
enhance the expression of some genes and not others. 
We found that genes induced by heat were the only 
genes with expression that was increased by Tfl insertion. 
This together with our finding that the transcription of Tfl 
itself was induced by heat suggests that the Tfl enhancer 
and the stress response genes were recognized by the same 
or similar activators of transcription. Indeed, the identifi- 
cation of a motif common to the promoters of Tfl and the 
genes that had expression increased by Tfl insertions 
support this model. Similar factors bound to adjacent se- 
quences often multimerize and cause synergistic increases 
in transcription. With this model, the insertion of Tfl next 
to a stress response gene would be effectively increasing 
the number of binding sites for the same transcription 
activators and thus stimulate transcription. 

Four hundred twenty-seven ncRNA genes have been 
detected in S. pombe (30). The significance of these genes 
is an important question that is being actively studied. It is 
interesting that of the 14 Tfl insertions chosen at random 
for this study, 5 were adjacent to ncRNA genes. And these 
five were associated with the most common positions of 
integration (Table 1). We also found that the expression of 
three of the five non-coding genes was increased by Tfl 
insertion. These observations together with the result that 
the expression of these three ncRNA genes was induced by 
heat indicates that the non-coding genes are among the 
stress response genes that are targeted by Tfl . These data 
suggest that some ncRNA genes may have a function 
related to stress response and the ability of Tfl to 
increase the expression of these genes may have a biolo- 
gical impact. 

Although retrotransposons and retroviruses have previ- 
ously been shown to enhance expression of genes pos- 
itioned at insertion sites, there are no estimates for what 
fraction of insertions cause changes in expression of genes 
at the target sites. In the cases where retrovirus or retro- 
transposons increase expression of adjacent genes, the 
events are rare insertions isolated with strong conditions 
of selection and obtained only after many generations of 
cell division. For example, after long latency periods of 
infection, mouse mammary tumor virus induces adenocar- 
cinomas of the mammary gland by activating the expres- 
sion of oncogenes at insertion sites (47-50). In 
S. cerevisiae, insertions of Tyl occurs upstream of tRNA 
genes (51-54). But in rare cases, Tyl insertions that 
activate or inactivate the expression of pol II transcribed 
genes can be isolated with strong positive selection where 
growth requires increased or decreased expression of a 
specific gene product (55-58). The availability of a 
genome-wide profile of integration sites and the finding 
that integration of Tfl commonly leads to increased 



expression of adjacent genes demonstrate that a large pro- 
portion of Tfl integration results in the enhanced expres- 
sion of stress response genes. This detailed survey of 
integration events provides a unique understanding of 
how integration impacts the biology of the host cell. 

Her discovery and pioneering studies of transposable 
elements in maize led McClintock to propose that trans- 
posons provide the host with important genetic diversity. 
Her observation that transposon mobility was induced by 
environmental stress led her to formulate the model that 
transposons reorganize the host genome as a means of 
responding to stress (11). The ability of Tfl to increase 
the expression of stress response genes is consistent with 
McClintock's intriguing hypothesis because Tfl insertion 
may protect a fraction of cells subjected to environmental 
stress. Our finding that heat and oxidative stress increased 
expression of Tfl provides additional support for this 
model because increased transcription of Tfl can lead to 
de novo integration (24,28). These observations together 
with the finding that Tfl insertions are directed to stress 
response genes suggest that Tfl may have evolved its tran- 
scription and integration mechanisms to improve the 
survival of cells exposed to environmental stress (1,21). 
Although this ability of Tfl to increase expression of 
stress response genes is an intriguing process, additional 
studies are necessary to test whether these transposon- 
mediated increases in expression actually benefit cells 
when exposed to stress. 
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